Applying ontology design patterns to the implementation of relations in GENIA
نویسندگان
چکیده
Motivation: Annotated reference corpora such as the GENIA corpus play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies and logic is challenging due to the ambiguous use of natural language and natural language semantics. Providing formal definitions and axioms for these relations would offer the means for developing consistent and verifiable annotation guidelines and allow for the automatic verification of annotations as well as enabling the discovery of new information through deductive inferences. Results: We developed a formal ontology of relations based on the relations used in the recent GENIA corpus annotations. For this purpose, we selected existing axiom systems based on the desired properties of the relations within the domain and provided new axioms for several relations. To apply this ontology of relations to the semantic annotation of natural language texts, we developed and implemented two ontology design patterns. We provide an implementation of the ontology of relations in the Web Ontology Language (OWL). By combining the implementation of the design patterns and that of the relation ontology, we also provide a software application to convert annotated GENIA abstracts into OWL ontologies. In this way, we make these ontologies amenable for automated verification, deductive inferences and other knowledge-based applications. Availability: Documentation, implementation and examples are available from http://www-tsujii.is.s.u-tokyo. ac.jp/GENIA/. Contact: [email protected]
منابع مشابه
Ontology design patterns to disambiguate relations between genes and gene products in GENIA
MOTIVATION Annotated reference corpora play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies is challenging due to the inherent ambiguity of natural language. The provision of formal definitions and axioms for semantic annotations offers the means for ensuring consistency as well as ena...
متن کاملContext-aware Modeling for Spatio-temporal Data Transmitted from a Wireless Body Sensor Network
Context-aware systems must be interoperable and work across different platforms at any time and in any place. Context data collected from wireless body area networks (WBAN) may be heterogeneous and imperfect, which makes their design and implementation difficult. In this research, we introduce a model which takes the dynamic nature of a context-aware system into consideration. This model is con...
متن کاملUnsupervised Learning of Semantic Relations between Concepts of a Molecular Biology Ontology
We present an unsupervised model for learning arbitrary relations between the concepts defined in a molecular biology ontology for the purpose of text data mining and support to manual ontology building. Relations are learned from the GENIA corpus, in which named-entities representing the GENIA ontology concepts have been tagged, by means of several natural language processing techniques. We ca...
متن کاملطراحی سامانه نیمهخودکار ساخت هستیشناسی بهکمک تحلیل همرخدادی واژگان و روش C-value (مطالعه موردی: حوزه علمسنجی ایران)
Ontology is one of formal concepts and the relations in the specific regions.It have recently tried to design the learning, automatic methods of Ontology. Whereas Ontology containing concepts and the relations, exploiting concepts, the semantic relations among concept. The various Ontology of regions and different applications are expensive processes that are automatic.The lack of main knowledg...
متن کاملEnhancing a Biological Concept Ontology to Fuzzy Relational Ontology with Relations Mined from Text
In this paper we investigate the problem of enriching an existing biological concept ontology into a fuzzy relational ontology structure using generic biological relations and their strengths mined from tagged biological text documents. Though biological relations in a text are defined between a pair of entities, the entities are usually tagged by their concept names in a tagged corpus. Since t...
متن کامل